THE SPECTRUM OF RANDOM LIFTS 
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Abstract. For a fixed d-rcgular graph H, a random n-lift is obtained by replacing 
each vertex v oi H hy & "fibre" containing n vertices, then placing a uniformly 
random matching between fibres corresponding to adjacent vertices of H . We 
show that with extremely high probability, all eigenvalues of the lift that are not 
eigenvalues of H , have order 0{y/d). In particular, if H is Ramanujan then its n-lift 
is with high probability nearly Ramanujan. We also show that any exceptionally 
large eigenvalues of the n-lift that are not eigenvalues of H , are overwhelmingly 
likely to have been caused by a dense subgraph of size 0{\E{H)\). 



1. Introduction 



o 
u 

^ ■ Expander graphs, graphs in which all small subsets exhibit good expansion prop- 

+^ . erties, are intriguing objects of study that arise in such diverse fields as number 

theory, computer science and discrete geometry. As Hoory, Linial and Wigderson 
remark in their wide ranging survey on expanders [12], one reason for their ubiquity 
is that they may be defined in at least three languages: combinatorial/geometric, 
^ . probabilistic and algebraic. We refer the reader to this survey and to the expository 

\ article of Sarnak [21] for more background on expander graphs and their applica- 

^ ■ tions. 



We shall be concerned, almost exclusively, with d-regular graphs, in which ev- 
ery vertex has exactly d neighbours. From the algebraic viewpoint, which we take 
throughout, a ci- regular graph G is an expander if there is a significant gap between 
A(G), the largest eigenvalue of the adjacency matrix of G (for a rf-regular graph 
this value is always d) and A2(G), the largest modulus of any other eigenvalue. 
^ ! Classic results of Dodziuk, Alon-Milman and Alon [1, 2, 9] show that the differ- 

I ence d — X2{G) controls the combinatorial expansion of G. More precisely, writing 

h{G) = mins \E{S, S'^)\/\S\, where E{S, S"^) is the number of edges from S to S^, its 
complement in V, and where the minimum is taken over subsets S of the vertices of 
G with l^l < |5"^|, we have 

^ < h{G) < ^2d{d-UG)). 



Given the theorem of Alon and Boppana [2, 19] that A2(G') > 2\/ d — 1 — o„(l) 
for every d-regular graph with n vertices, it is particularly significant if A2(G') < 
2\/ d — 1, in which case the graph is said to be Ramanujan. A major open problem is 
to prove the existence of infinite families of d-regular Ramanujan graphs for all d > 3. 
Explicit constructions coming from number theory given by Lubotzky, Phillips and 
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Sarnak [16] and Margulis [17] for the case that d — 1 is prime represented a major 
breakthrough. Morgenstern [18] gives examples of such families whenever d — 1 is a 
prime power. However, it seems unlikely that number theoretic approaches will be 
successful in resolving the problem in its full generality. 

A combinatorial approach to the problem, initiated by Friedman [10], is to prove 
that one may obtain new (larger) Ramanujan graphs from smaller ones. In this 
approach one starts with a base graph H which one "lifts" to obtain a larger graph 
G which covers the original graph H in the sense that there is a homomorphism 
from G to H such that all fibres in G of vertices of H are of equal size. If G is a 
cover of H and the fibres in G of vertices in H have size k, then G is called a /c-lift 
of H. 

It is easily observed that the lift G inherits all the eigenvalues of the base graph 
H. Indeed, let fi be an eigenvalue of H with eigenvector x, and define a vector y 
with entries indexed by V{G) by setting, for each i, i/y = Xi for all vertices v E Vf, 
then y is an eigenvector of G with eigenvalue /i. In fact these lifted eigenvectors 
of H span the space of all vectors that are constant on each of the fibres Vi of the 
lift. The remaining eigenvalues of G are referred to as the new eigenvalues of the 
lift (note however that it is possible for some new eigenvalues to be equal to "old" 
eigenvalues). Since eigenvectors of symmetric real matrices corresponding to distinct 
eigenvalues are orthogonal, these are exactly the set of eigenvalues which have an 
eigenvector x which is balanced on each fibre (i.e. for which Ylvev = for all 
i e V{H)). Since the base graph is given it suffices to concentrate our study on 
the new eigenvalues of G. We denote by X*{G) the largest absolute value of a new 
eigenvalue of G. (For the remainder of the paper, for any graph F we denote by 
X{F) the largest eigenvalue of F.) 

A random n-lift G of a graph H is obtained by assigning to each vertex i oi H 
a distinct set Vi of n vertices, and placing a random matching (i.e. one chosen 
uniformly at random from the n\ possibilities) between Vi and Vj for each edge ij 
of H. Random lifts were introduced by Amit, Linial, Matousek and Rozenman 
[6]. In that article a variety of properties of random lifts are discussed, related 
to connectivity, expansion, independent sets, colouring and perfect matchings; the 
proofs of these results, and others, were developed in several subsequent papers 
[3-5, 14]. As remarked in [10], any finite cover F of if in which fibres have size 
n has a positive probability of appearing as G (in fact this probability is precisely 
(^72!)l^(^)~'^(^)l (Aut(F//J))^\ where Aut(F/_f/') is the group of automorphisms of 
F over H), and so such random lifts form a "seemingly reasonable model of a 
probabilistic space of finite quotients of [the infinite rf-ary tree]". 

Although very few graphs are known to be Ramanujan, it is conjectured that 
a positive proportion of regular graphs are in fact Ramanujan, and Alon's conjec- 
ture/Friedman's theorem states that for any e > 0, only an asymptotically negligible 
proportion of d-regular graphs have \2{G) > 2y/d — 1 + e. In this spirit, Friedman 
[10] studied the eigenvalues of random lifts of regular graphs, and Lubetzky, Su- 
dakov and Vu [15] conjectured that a random lift of a Ramanujan graph has a 
positive probability of being Ramanujan. Since for all d the complete graph K^+i 
is a d-regular Ramanujan graph, this would imply the existence of arbitrarily large 
(i-regular Ramanujan graphs. In the terminology from [10], the main result of this 
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paper implies that with extremely high probability the lifts of Ramanujan graphs 
are 0(-\/d)- weakly Ramanujan, in that all non-trivial eigenvalues are 0{\/d). 

In [10], Friedman used the trace method of Wigner to prove results which in 
particular imply that if H is ci-regular then \*{G) = 0{d^^^) whp^. This was later 
tightened to 0{d'^^^) by Linial and Puder, by a careful analysis of the trace method. 
They also made a conjecture concerning word maps, which if verified would prove 
X*{G) = 0{d^/^) whp. Two initial cases of the conjecture were proved, a third has 
been proven more recently by Lui and Puder [13]. Bilu and Linial [7] then showed 
that every rf-regular graph H has some 2-lift G with \*[G) = O^d^^"^ log^^'^ d). The 
next major step in this area was taken by Lubetzky, Sudakov and Vu [15], who 
proved that \*{G) = 0(max(\2{H), d^^'^)\ogd) whp. In particular, in the case that 
\^(H) = 0{d^/^) this result gives that \*{G) = 0{d^/Hogd). 

In this article we prove that whp \*(G) = 0{d^^'^), a result which is best possible 
up to the constant. 

Theorem 1. Let H be any d-regular graph and let G he a random n-lift of H. For 
all n sufficiently large, with probability at least 1 — n~'^'^^^^ , X*{G) < A30Q56y/d. 

Furthermore, we are able to explain the likely cause of large eigenvalues should 
they occur. This cause is, with very high probability, a small (i.e. of size not de- 
pending on n) subgraph of G. 

Theorem 2. Let H be any d-regular graph, write h = \V{H)\, and let G be a random 
n-lift of H . For all n sufficiently large, with probability at least 1 — n~^'^, G contains 
an induced subgraph G' with at most hd vertices, such that \*{G) < 1189248A(G"). 

One might protest that the eigenvalue of G' is not necessarily a new eigenvalue of 
G, and so G' is not the 'cause' of a new eigenvalue of large modulus in G. However, 
the following approximate converse to Theorem 2 justifies our use of such an epithet 
for G'. 

Proposition 3. For any induced subgraph G' of G with \V{G')\ < n — h^fn, we 
have \\G) > \{G') - 7/2. 

The short proof of Proposition 3 appears in Section 7. 

Costello and Vu [8] remark that "[t]he main intuition that underlies many prob- 
lems concerning the rank of a random matrix is that dependency should come from 
small configurations," and their paper can be seen as confirmation of this intuition 
for a rather broad class of random matrices. In this spirit. Theorem 2 should be 
viewed as stating that for random lifts, any exceptionally large eigenvalues come 
from small configurations. It would be very interesting to know whether our "ex- 
ceptionally large" can be replaced by "slightly large". A rather ambitious question 
one could ask in this direction is the following. 

Question 4. Is there a constant G not depending on n (perhaps depending on H) 
such that for any e > 0, given that X*{G) > (2-\-e)y/d — 1, with probability 1 — o„(l), 
G contains a subgraph G' with at most G vertices such that A(G") > (2 + e/2)vw— 1 ? 



If a statement holds with probabiUty which tends to 1 as n tends to infinity, we say that it occurs 
'with high probabihty', or 'whp' for short. 
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We note that the probabihty bound in Theorem 2 is extremely strong. Indeed, 
the failure probability, n~^'^ is much smaller than the probability that G contains H 
as a subgraph - the probability of the latter event is greater than n"'*'^/^ - in which 
case \*{G) = d. 

Our proof, like that of Lubetzky, Sudakov and Vu [15] and many others, relies on 
reducing an uncountable collection of possible 'reasons' for a large eigenvalue to a 
finite (and hopefully relatively small) sub-collection which still express all ways in 
which a large eigenvalue can occur. They used the well-known method of e-nets to 
make this reduction. (Amit and Linial [5] also used e-nets to prove a lower bound 
on edge expansion for random lifts of connected graphs which need not necessarily 
be regular.) However, the number of events (points of the e-net) one is required to 
consider is exp(6(n/ilog(i)). The appearance of the \ogd here is a major obstacle 
to proving that X*{G) = 0{\/d) by the e-net approach. Our approach is based 
on a convexity argument that allows us to reduce to a smaller, exp(0(n/i))-sized 
family of events. Furthermore, the events in this collection are concerned with 
vectors with dyadic entries. These are easier to deal with than general vectors; in 
particular, direct combinatorial arguments may be applied and we need not appeal 
to martingale inequalities to obtain probability bounds. 

Modulo a few trivial changes to our proof (e.g. changing "precisely" to "at most" 
in the proof of Proposition 8, below) replacing d hj A throughout gives a proof of 
the following generalisation to the case that the base graph H is not regular. 

Theorem 5. Let H be any graph of maximum degree A and let G be a random 
n-lift of H . For all n sufficiently large, with probability at least 1 —n^'^^^^^, X*(G) < 
430656a/A, and with probability at least 1 — n~^^, G contains an induced subgraph 
G' with at most HA vertices, such that X*{G) < 1189248A(G')- 

Since G will always contain two edge disjoint stars whose centres have degree A 
and lie in the same fibre, X*{G) > \/A. Thus this result is also tight up to the 
constant. 

While we focus here on the case of an n-lift where n is large we would like to bring 
to the reader's attention the recent result of Oliveira [20] that a random n-lift G of a 
graph H on h vertices with maximum degree A satisfies X*{G) = 0{A^/'^ log^ ^\hn)) 
whp. 



2. Notation 

All logarithms in this paper are natural logarithms unless otherwise specified. 
For positive integers k we write [k] = {!,..., k}. We write No = {0, 1,2,.. .} and 
N = {1,2,.. .}. For any graph F = (V, E) and u,v ^ V we write u v if uv ^ E. 
For the remainder of the paper, d > 2 is a positive integer, H = {[h], E{H)) is 
a fixed d-regular graph, and G = {V{G), E{G)) is the random n-lift of H, where 
V{G) = {{i,j),i G [h],j G [n]}. For i in [h] we write Vi = : j G [n]}, and call 

Vi the fibre of i in G. Let M be the adjacency matrix of G. 



THE SPECTRUM OF RANDOM LIFTS 



5 



For any m-hj-m matrix A = {auv)u,velm] and vectors x,y E M™, we write {x, y)^ = 
SuueH -^uduvyv Also, for a set C M^, we write {x,y)y^^ for the restricted sum 

^ ^ Xu^uvyv 

Finally, we use the Vinogradov notation f <^ g to mean that / = 0{g), i.e. / is 
bounded by a constant times g, independent of n. We write / x (7 to mean that 
f = Oig) and^7 = 0(/). 

3. An overview of the proof 

As in [11] and much subsequent work, we will bound the eigenvalues of G using 
the Rayleigh quotient principle, which is to say by bounding (x, x)^^ for suitable 
vectors x. More precisely, writing X = {x E R^^"^) : ||x||2 < 1}, Rayleigh's quotient 
principle tells us the following. 

Fact 6. \*{G) = sup{| {x,x)j^,j \ : x e X,\f i e [h], Xlj^ev, Xv = 0}. 

Fact 6 forms the basis for our study of \{G). However, to make use of it we first 
need to have an idea of the diversity of possible ways in which G could admit a 
balanced vector (a vector satisfying J2v&Vi = ^ for all i G [h]) having a large value 
for {x, x) j^. To begin to get a feel for this, we now give two rather different examples 
of how \*{G) can be large. For simplicity, for the examples we assume that H is the 
complete graph K^+i- We also provide bounds on the probability of such examples 
occurring in G. These bounds give something of the flavour of the bounds we shall 
be required to prove for the general case. 

Example 1: G contains a large-ish clique. 

Fix s G N and vertices Wi, . . . , t;^ of G in distinct fibres. If G'[{f 1, . . . ^Vs}] happens 
to be a clique, then setting Xu to be 1 if m G {vi, . . . , w^}, —l/{n — 1) for all other 
vertices in the same fibres as Wi, . . . , w^, and zero on all other vertices, we obtain a 
vector X, balanced on each fibre of H and for which Mx = (s — l)x. Thus, if G 
contains a clique of order at least K\fd+ 1 then \*{G) > Ky/d. 

To bound the probability that G contains such a clique, for each possible choice of 
Vi, . . . ,Vs the probability that 1, . . . , f^}] is a chque is n~^'^ (since each edge ViVii 
is present in G independently with probability n~^). Since there are only {^'^^g^^)n'^ 
choices of the s-tuple (wi, . . . , w^) the probability that G contains a clique of size s 

is at most (^'^^^■')?t,*~^2), which is o(l) for any s > 4. 
Example 2: uneven edge densities all over G. 

Suppose that there exist sets {Ai)i(zy(H)^ with Ai C Vi and \Ai\ = n/2 for all 
i G V{H), such that e{Ai,Aj) > n/A + Kn/y/d for each i ^ j, i,j G V{H). Then 
setting Xu to be {nh)~^^'^ if u is in IJj and — (ra/i)"^/^ otherwise, we obtain a 
balanced vector x with ||x||2 = 1 and {x,x)j^.j > 2K\/d, so X*{G) > 2K\fd. 

Here, for each choice of sets (Aj)iey(H); with Ai C Vi and \Ai\ = n/2 for all 
i G V{H), the probability that for all i,j G V{H), i j we have e{Ai,Aj) > 
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n/4 + Kn/\fd, is of order at most exp{~c{^1^^K'^n/d), for some constant c > 
(this is not hard to derive by hand; it can also be obtained straightforwardly from 
Proposition 13 in Section 4). \i K is sufficiently large, this bound is strong enough 
for a union bound to show that with high probability, there is no such choice of sets 

{^i)i(iV{H)- 

The preceding examples present two rather different structures within G, both 
of which give rise to large new eigenvalues, and show how the new eigenvectors are 
also rather different. We may also switch our point of view, first fixing a vector x 
(for the first example a vector taking the value 1 on a single vertex of each fibre 
l^j, = 1, . . . , s and taking the value — l[„gv"[sj]/(?i — 1) on other vertices U] for the 
second example a vector taking the value (n/i)~^/^ on n/2 vertices in each fibre and 
— {nh)~^/'^ on the rest) then asking what structure in G is required if | (x, x) ^.^ | is to 
be large for this specific vector x. This is essentially the perspective we will take for 
most of the rest of the paper. 

From this viewpoint, a possible cause for \*{G) being large consists of a vector 
X G X together with evidence, in the form of specified edge counts between subsets 
of vertices of G, that | (x, x)^^ | is large. For a vector x, z G [h] and G M and let 

Ai,w{,x) = {v ^Vi : x^ = w}, aj,«,(x) = |Aj,^(x)|. 

We write Ai^^ = y4j_^(x) and ai_^ = aj,^(x) when the dependency on x is clear. We 
say that the collection {cj^^ : z G [/;.], G M} is the type of the vector x, and denote 
this collection a(x). By the symmetry of the model, the probability that | (x,x)^j | 
is large should be a function only of the type of x. We will seek sets of constraints on 
the edge densities between sets corresponding to distinct Qi^^ and aj/^^/, which codify 
all possible ways in which a large new eigenvalue can appear. A type, together with 
a particular such set of constraints, will be called a pattern; the precise definition of 
patterns will appear later in the section. 

While this initial concept of a pattern is useful for demonstrating the idea we 
have in mind, a number of changes are needed before we can put it into play. The 
most fundamental issue is that we are trying to bound sup^.^^^^ (x, x)j^^, the supremum 
being over an uncountable collection. We will shortly show that for a moderate cost, 
the uncountable supremum in Fact 6 can be replaced by a supremum over a more 
tractable collection. Also, it turns out that for any vector x G X, the contribution 
to (x, x)^ made by entries Xij,Xi'jr whose weights differ by more than a factor of 
is negligible, which will allow us to further restrict the collection of types we 
need to consider. This is not a complete list of the required changes, but before 
proceeding too far into the argument it is useful to fill in some of these initial steps. 

Recall that M is the adjacency matrix of G, and let M = (jn{i,j),(i' j')) ,j')ev{G) 
be the matrix with 

{1/n if z i' 
otherwise. 

In other words, TTT^{i,j),{i',j') = ~g (^'^j')}; so for all x G M^^*^), we have 

{x,x)jj = E [{x,x)j^j]. If X is balanced on each fibre, i.e., satisfies ^^gy. x^, = for 
all i G [h], then (xi,Xi)^ = xl(Mx,) = x* ■ = 0. Thus, letting iV = M - M, for 
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such X we have 

(x, x) j^j- = (x, x) jyj — (x, x)jj = {x, x) jy.j . 

If, on the other hand, x is constant on fibres of G, then {x,x)^,j- = and so 

{x,x)j^ = 0. From this we easily obtain the following fact. 

Fact 7. X*{G) — sup^-g^f I {x,x) ^ I . 

Proof. The inequality A*(G) < sup^g^^ | {x,x)^ \ follows from Fact 6 since {x,x)j^ = 
{x, x)jy,j if X is balanced on fibres of G. On the other hand, any vector in x e X can 
be expressed as y + z, where y E X is balanced on fibres of H and z is constant 
on fibres of H. Since {y,z)j^ = 0, we then have (x, x)^ = {y,y)j^ = {y,y)j^,j, which 
proves the other inequality. □ 

By using instead of M, Fact 7 allows us to maintain the property of only 
considering "new" eigenvectors of M, without insisting that the vectors we consider 
remain balanced on each fibre of G. This turns out to be remarkably helpful. 

Next, let = {0} U {2'/{nhy/^ : z G N}, let 

Z = Ix eR^'^^^ : \\x\\l<10,^ V eV(G),x^ e D+, sup x^ <d- inf x^ I . 

(1) 

and let 

E* = {(xi,X2) G : Xi,X2 > 0,Xi/x2 G rf^/^)}. 

The following proposition, which is proved in Section 7, formalizes our discussion on 
"restricting the collection of types we need to consider" . 

Proposition 8. \*{G) < QGsup^g^ | {x,x)j^^^ \ +480^(1. 

A relatively straightforward step in the proof of Proposition 8, that will also turn 
out to be useful in another part of the paper, is to show that not much is lost when 
we restrict our attention to pairs of vertices whose weights in x differ by at most 
a multiplicative factor of y/d. This is the content of the following lemma, which is 
proved in Section 7. 

Lemma 9. For all y G R^^^\ \ {y,y)j^^^,^^, \ < 4:y/d\\y\\l 

We are now in a position to further elaborate on what will constitute a pattern 
of a large eigenvalue. Recall that for y G M^(*^\ for each i G [h] and each w G M, we 
have Ai^w{y) = {v E V{G) : y^ = w}, and ai^y^{y) = \Ai^^{y)\, and that a(t/) is called 
the type of y. We remark that if y E Z then a(?/) satisfies the following properties. 

1. For all w E R, w ^ D^, we have a^,^ = 0. 

2. For all i G V{H), J2 

3. There exists wq G D~^, wq > 0, such that a^,^ = unless wq < w < wod. 
4- J2^&[hlweD+^^('i,^ < 10. 
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We call a collection a = {ai^ui '■ i ^ V{H),w G M} a Z-type if it satisfies properties 
1-4, so that if ?/ G Z then the type a{y) of ?/ is a Z-type. Next, given y G M^^'^^ and 
a' G E{H), w,w' G M, let ei,^y,^„/(?/, G) = |eG(^i,t«(|/), -4i/_^/(y)|, and write 

e{y,G) = {ei^nj,i',w'{y,G) : li' G E{H),w,w' G M}. 
For all m' G E{H) and all G M, we necessarily have 

ei,u,,i',w'{y, G) G {0, 1, . . . , mm{ai^^{y), ai>^^>{y))}. 
A pattern is a pair (a, e), where a is a Z-type and 

e = {ei^w,i',w' '■ a' G E{H), w, w' G M} 
is a collection with ej^^^j/,^/ G {0, 1, . . . ,mm{ai^w,0'i',w')} for all ii' G E{H) and all 



Write = \ {0} = {27Vn/i : i G N}, and let T be the graph on vertex 
set : 2 G V^(i/),w G -D^°} with an edge {i , w) {i' , w') if z i' and w/w' E 



the factor 2 arising from the symmetry of the matrix A^. We say that a pattern 
(a, e) can be found in G if there exists y E Z such that a(y) = a and e(?/, G) = 
e. In this case we say that G contains the pattern (a, e), and call the collection 
{Ai,w{y)}{i,w)£Vir) a witness for the pattern (a, e). 

Proposition 10. For K > 0, z/A*(G) > 192{K + 3)Vd then a pattern (a, e) with 
potency at least K\fd can he found in G. 

Proof. By Proposition 8, in this case there exists y E Z with \{y,y)N,E*\ > 2K\/d, 
and the result follows from (2). □ 

Our main aim, then, is to show that with high probability, no high-potency pattern 
can be found in G. To do so, we shall use a reduction which is conceptually analogous 
to how one bounds the probability that a fixed graph H appears as a subgraph of 
the Erdos-Renyi random graph G{n,p): rather than proving the bound directly for 
H, one instead considers a strictly balanced (maximally edge-dense) subgraph of H. 

Given a pattern (a, e) and a set of vertices S C V{r), we define the sub-pattern 
of (a, e) induced by S to be the pattern (a', e') obtained from (a, e) by setting 



w,w' G M. 




and call p(a, e) the potency of (a, e). We remark that for any y E Z, 




2p(a(y),e(y,G)) 



(2) 




otherwise. 



if {i, w) E S 
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and setting 

otherwise. 

We write (a, e)^ for the sub-pattern of (a, e) induced by S. 

We also require the following variant of potency, which allows us to consider a 
"maximally potent" subgraph of F rather than F itself. For a pattern (a, e) we define 



n((a, e)) := max 



{j,-u;)(i',io')e-B 

We will prove the following theorem. 



WW I Ci ,„ ii ,„> 



n 



Theorem 11. Fix L > 20. For any pattern (a, e), there exists S = S'((a, e),L) C 
V^(F) such that the following properties hold. 

p((a,e)5)>%^-55Lv^ 



P{(a, e)5 can be found in G} ^ Y[ ^t!w | Y\. 



1-L/W 

n ^ 



We prove this theorem in Section 5. Theorem 1 will follow straightforwardly from 
Proposition 10 and Theorem 11. More precisely, Proposition 10 ensures that the 
number of events whose probability we must bound is not too large, and Theorem 11 
ensures that the probability of each event is sufficiently small that we can simply 
apply a union bound to prove Theorem 1. Before providing the proof, we state 
one additional, easy bound which we will require, on the total number of patterns 
satisfying a constraint on the sizes of the ai^w- 

Lemma 12. For fixed A G N, the number of patterns with ai^^ < A for all {i,w) G 
V(r) is at most log2(n/i) ■ ^^'^'^^"Sad. 

Assuming this lemma, which is proved in Section 7, we are now ready to give our 
proof of Theorem 1. (The proof of Theorem 2, while conceptually almost identical 
to that of Theorem 1, ends up requiring a somewhat more technical development 
and we therefore defer it to Section 6.) 

Proof of Theorem 1. Fix M > 430656 = 192(2240 + 3), let K = K{M) = 
(M/192) - 3 > 2240, and let L(M) = A7112 > 20. 

By Proposition 10, if X*{G) > MVd then there is a pattern (a, e) with potency 
at least Ky/d = 112L\/d such that (a, e) can be found in G. Let (a, e)5((a,e),L) be 
the sub-pattern of (a, e) obtained by applying Theorem 11 to (a, e), it follows that 
p((a, e)5((a,e),L)) > (112/2 - 55)Lv^ = Ly/d and that (a, e)s({a,e),L) can be found in 
G. 

We say a pattern (a, e) is an L-reduction if there is a pattern (a', e') and S = 
S {{a' , e')) , L) as in Theorem 11, such that {sl' ,e')s{(a.',e'),L) = (a, e). It follows from 
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the preceding paragraph that for any M > 430656, 

p|a*(g) > MVd^ < P{^}, 

where E = E{M) is the event that G contains an L(M)-reduction (a, e) with 
e) > L{M)\fd > 20\/d. We use Theorem 11 to bound this probabihty for each 
fixed pattern (reduction) (a, e), the proof is then completed with a union bound. 
We now give the details. We split into two cases, depending on whether the pattern 
(reduction) (a, e) has > Ahdlog2d for some {i,w) G V{r). Let Ei = Ei{M) be 
the event that G contains an L-reduction (a, e) with p(a, e) > L^/d in which at least 
one tti^w > 4hdlog2d and E2 = E2{M) the event that G contains an L-reduction 
(a, e) with ^(a, e) > L\/d in which a^^^ < Ahd log2 d for all i, w. We note that Ei and 
E2 need not be disjoint. To prove Theorem 1 it suffices to show that both P {Ei} 
and P {E2} are less than ^/2 for all n sufficiently large. 

First note that for any L-reduction (a, e), writing a = J2{iw)ev{r)^i,w^ since 
1 — L/10<— 1, by Theorem 11 we have 

P {(a, e)s can be found in G} < JJ af/j ■ ( ■ 

{i,u,)es \a/\in/^ij 

For any L-reduction (a, e) considered in Ei, we have a > 4hdlog2{d). We note that 
since non-zero entries in any Z-type differ by a factor of at most d, for any i G V{H) 
there are at most log2(2(i) values w 7^ for which aj „, ^ 0. It follows that in total 
there are at most h\og2{2d) vertices {i,w) G V^(r) with a^^^ 7^ 0, so 



P {(a, e)s can be found in G} <^ n 



4:hd\og2{d)^ 
8hd\og2d-' 



n 



We next take a union bound over all L- reductions considered in Ei. Clearly the num- 
ber of such reductions is at most the total number of patterns, which by Lemma 12 
applied with A = n is at most \og2{nh)n'^^'^^°^'^'^ . It follows that 

P{B.}«(8Mlog,d)-— '^i^gL. 

and so P {L'l} < ra~^'^^''^/2 for n sufficiently large. 

Next, for any L-reduction (a, e) considered in L^2, we have 

n aj^^ < {Ahd\og2df''/'^^'''^-^''\ 

{i,w)£S 
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Also, for any pattern (a, e), it follows straightforwardly from the definition of a 
pattern that 



n 



p(a, e) < WW 

< ww' ram(ai^w, o.i',w' 



< 



{i,w)&V{r) (i' ,w')'^r{i,w) 
w'<w,ai ii,>0 



(i',«i')ev(r) {i,w)<^E{v) 

{i,w)(iV{T) 



It follows that a = ^ 



> > 3(i^/^ for every L-reduction (a, e) with 



p(a, e) > L\fd. So, for any pattern considered in £'2, for n > Qd^/"^ we have 



n 



a A [n/2\ 



-1 ^ (6rfl/2)3dV2 



n 



Furthermore, the total number of L-reductions considered in E2 is at most the total 
number of patterns with all a^^^ less than Ahd logg which by Lemma 12 is at most 
log2(n/i)(4/i(ilog2 (i)^'^'^'°S2'^. Thus, by a union bound, 

P{E2} « {AhdXog^df^"'^^^" ■ {Qd^/^f'" ■ 



which implies that P{i?2} < n '^'^^''^ /2 for n sufficiently large. This completes the 
proof. □ 



4. Two TOOLS FROM PROBABILITY 



In this section, we establish all the probability bounds we will require in the 
remainder of the paper. Proposition 13, below, bounds the probability that a random 
matching has certain prescribed edge counts between given pairs of sets. Lemma 15 
gives a lower bound on the integral of a function when its value is not too small 
relative to another function. We proceed immediately to details. 

Let V = {vi, . . . ,Vn}, W = {wi, . . . ,Wn}, and let M be a uniformly random 
matching of V and W. Let Aq, . . . ,As and Bq, . . . ,Bt be partitions of V and W 
respectively, and write = \Ai\ and bj = \Bj\. (Here s and t are constants not 
depending on n.) Writing e-^iAi, Bj) for the number of edges from Ai to Bj in M, 
it is straightforward that Hij = E [e{Ai,Bj)] = aibj/n. 

Now fix integers {eij}o<i<s,o<j<t with Yl]=o^ij ~ e&ch < i < s and 

^^=0 ^ij ~ ^3 ea.ch. < j < t. (From now on, summations and products over i 
(resp. j) should be understood to have < i < s (resp. < j < t).) We wish to 
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bound 



p|n{e(A,5,) = e.,}| 



Our aim is to prove a bound not too different from wliat we would obtain were the 
e{Ai, Bj) independent with Binomial(ajfej, 1/n) distribution. By direct enumeration, 

P {nij{e(^i,^i) = eijlj equals 



-1 

with the convention that 0! = 1. Applying Stirling's formula, this is of the same 
order as 

Mi^V". n ^T' (3) 

Now write Cij = {aihj/n){l + e^j) = + eij). We then have 



= U^r''^"'^ n (1 + 



n 

3 



ij ij ij-eijf^O 

For fixed i, since Cjj = Oj we must have ejj/ijj = 0. Likewise for fixed j we 
have J2i^ijf^ij = O5 it follows that 

n''r"=n(n(^)'""'"'nf'"''" 
=n(n''r")=i- 

A similar calculation shows that 

i j 

and so (3) equals 

1/2 

/ llia^ll,OJ \ 

n 
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We now focus our attention on the latter product of the preceding equation. It is 
helpful to multiply and divide by 1 = Ylij e'^'^^^^ , to obtain the equivalent expression 

JJ exp + eij) log(l + eij)) JJ exp (nijeij) 

= Yl exp + ejj) log(l + eij) - eij]) JJ exp (-^Uij) . 

ij-.Cij^O ij:eij=0 

(For the second product we use that when e^j = 0, e.y = —1.) Since (1+e) log(l+e)— e 
approches 1 as e J, —1, taking log — = 1 by convention then allows us to combine 
the two preceding products. In sum, we have established the following proposition. 

Proposition 13. Let e^j = + ejj) and let = CMi^i, Bj) . Writing x = 

n-V2 f n.a.n, 6. y^', we have 



P I f]{E,, = e,, 



ij) ^ij] 



In fact, it is a weakening of Proposition 13 that will be useful later in the paper. 
Corollary 14. Under the conditions of Proposition 13, we have 



p|n{^^^=^^^}l« ( n n 

I ij ) \l<i<s l<j<t / 



1/4 

. EijMij[(l+<:y)log(l+eij)-eij] 



Proof. For each < i < s we have no<j<t:e,,^o ^ij > «i, so nij:e,,^o > Hi ^i- We 
likewise have Hij-e^.^o ^ij ^ Ylj ^i' and so 

n e«>n«;'^n''f' 

zjieijj^O I j 

Also, n'^/^aQ^^^y^ < 1. The result follows. □ 

We will also use a lemma which is essentially a version of the following basic fact: 
if X and Y are non-negative random variables and E is the event that X < cY, then 
E [Xl[£;]] < cE [Y]. (For generality we shall state the lemma in terms of a measure 
space, although we shall only apply it to finite measure spaces, and in this case the 
reader may think of the integrals as simply weighted sums.) 

Lemma 15. Let v) he a measure space and g,h : Q ^M. positive measurable 

functions, and fix < c < 1. Writing 

E={ujen:^>J^'^^ 



g{uj) Jgdfi 



we have 



hdfi > (1 — c) / hdfi. 
E Jn 
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Proof. 

I hdjj, = I g ■ — d/i < c I g ■ ^^rf/i < c ( hdjj 
Jn\E Jn\E 9 Jn\E J 9 Jn 

□ 

5. Reduced patterns: a proof of Theorem 11 

Theorem 11 requires us to find, given a high potency pattern (a, e), a sub-pattern 
(a, e)s which is unhkely to occur in G. In fact, our aim will be slightly more exigent. 
We will show the existence of sub-pattern (a, e)^ which is 'locally unlikely' at all 
{i,w) G S. Given a pattern (a, e), for i i' and w,w' G M write ^i^w,i',w' = 
0'i,wO'i' ,w' / and write 

1 if tti^w = or tti/^w' = 0, 

Gi,w,i',w'/ fl'i,w,i',w' "1 if 0'i,w 7^ and tti'^u]' 7^ 0. 

The quantity ei^w,i',w' encodes the deviation from the expected number ^i^w,i',w' of 
edges between sets A G Vi, A' C Vi' with \ A\ = ai^w and \ A'\ = ai'^w' if \Eg{A, A') \ = 
e-i,w,i',w'- We will bound the likelihood of such deviations using Corollary 14, and to 
that end define a function h : (—1, oo) — > [0, oo) 

6(e) = (l + e)log(l + e) -e eG(-l,oo). 

We then have the following. 

Proposition 16. Fix L > 20. For any pattern (a, e), there exists S = ^((a, e), L) C 
V{r) such that the following properties hold. 

T)( Si G ) 

p{{ai,e)s)> — ^ 55Lvd and 

6(6^,^,^/,^/) > -^ai,^log (—] for all{i,w) G S . 
n lU \CLiw J 

The second inequality in Proposition 16 encodes the idea that the pattern is 
everywhere locally unlikely. Before continuing to the proof of Proposition 16 let us 
show that Theorem 11 does indeed follow from Proposition 16 and the probability 
bound Corollary 14. 

Proof of Theorem 11. Given L > 20 and a pattern (a, e), let S = S{{a,e),L) be the 
subset of V{r) given by Proposition 16. Since it is immediate from the statement 
of Proposition 16 that 

p((a,e)5)>^^^-55Lv^, 
all that is left to prove is that 

I-L/IO 

n 



P {(a, e)^ can be found in G} < JJ afj j JJ ( 



w A I n/2 I 

{i,w)eS \{i,w)£S ^ ' L / J, 

We recall that finding a copy of (a, e)^ in G corresponds exactly to finding a witness, 
i.e. a collection of sets {^i,i«}(j,«;)e5; such that for each {i,w) G S the set Ai^^ C 
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Vi has cardinality ai^u, and such that e{Ai^, Ai> = ei^i'w' whenever {i,w) ~r 
Since there are at most 

n \ai^A\n/2\) 

(i,w)£S ^ ' L / J/ 

choices of the collection {Aj,u,}(i,«))es it suffices to prove that 

-L/IO 

I {i,i«)~r{i',w') J {j,«>)G5 \(j,w)6S ^ ^''^ L / J/ 

for each such collection {Ai,«,}{j,u,)e5. Furthermore, since the inequality (en/a)" > 
(aALn/2j) holds for all 1 < a < n, it suffices to prove that 

I (i,u))~r(*'i'''') 



< n ■ 

(j,-i«)6S 



■ ^ (^) 



for each such collection {Ai^w}{i,w)£S- 

The required bound now follows by applying Corollary 14 for each matching, then 
using the independence of the matchings and the second bound of Proposition 16. 

□ 

The rest of the section is dedicated to proving Proposition 16. We divide into two 
cases based on whether the potency of (a, e) derives mostly from large deviation 
terms (terms in which is particularly large) or small deviation terms. The 

reason for splitting the proof in this way is that the expression b{ei^w,i',w') behaves 
rather differently in the two cases, resembling e log e and respectively. 

We shall partition P into two subgraphs Pld = rLD(a, e) and Psd = rsD(a, e) by 
setting 

\/(Pld) = V{Tsb) = V{T) 

and defining 

^(rLo) = {(hw) ~r ii',u;') : ei,^y,^/(a, e) > - 1} , 
^(rso) = {(hw) ~r ii',u;') : -1 < ei,^y,^/(a, e) < - 1} . 

We have chosen — 1 as the cutoff between the large and small deviations regimes 
as a matter of technical convenience to do with the details of the proof. Note that 
the potency p(a, e) of a pattern (a, e) may be expressed as 



p(a, e) 



n 

(i,u!)~r («',"!«') 



E 
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We define now two variants of potency. 



PLDia,ej 



and 



E 

(j,u.)~rLD {^','w>) 



(i,«))~rgD{«',i«') 



WW ai,„ait ,ni 



n 



WW a^wai'w' 



^7. 7;f 7/ 1/;' 



n 



The latter expressions respectively capture the "large deviations potency" and "small 
deviations potency." Since p(a, e) < pLD(a, e) +psd(cI, e), for every pattern (a, e) 
we have that 

max{pLD(a,e),pgD(a,e)} > — ^ — . 
We shall prove the following propositions. 

Proposition 17. Fix L > 20. For any pattern (a, e), there exists S = S{{a.,e), L) G 
V{r) such that the following properties hold: 

PLD((a,e)5) >pLD(a,e) - 30Lv^, 

and, for all {i,w) G 5*, 



E 



• U H ^ log(l + £i,w,i',w') > -Ttti^w log 

n \ 2 / 4 \ a,- ,„ 



(j',-u)')GAfrm {i,w)nS 

Proposition 18. Fix L > 20. For any pattern (a, e) , there exists S = 5'((a, e), L) C 
V(r) such that the following properties hold. 

> psD(a, e) - hhL\fd 



E 

{i',«)')eAfrsD {hw)r\S 
Since 



PsD((a, (i)s 



f2 

i,w,i'w' 



n 



15 



> — Oi,^ lo 



10 



en 



and 

for all{i, w) & S . 



J 15 if e < — 1 

- ^ (l + |)log(l + e) ife>e2-l, 



(4) 



Proposition 16 follows immediately from Propositions 17 and 18 by applying the ap- 
propriate proposition to the pattern (a, e) (i.e. applying Proposition 17 if pLD(a, e) > 
p(a, e)/2 and applying 18 if psD(a, e) > p(a, e)/2). The required bound on p((a, e)^) 
is obtained by considering the set E C E{T) given hj E = FldU or by = Fgols, 
as appropriate. 

The proofs of Proposition 17 and Proposition 18 are similar. In both cases the set 
S is found by repeatedly applying some straightforward reduction rules. In spirit, 
these rules can be understood by analogy with the following procedure. Suppose we 
are given a graph F = (V, E) with average degree fi. To find an induced subgraph 
with minimum degree at least fi/2, we may simply repeatedly throw away vertices of 
degree less than fi/2 until no such vertices remain. Throwing away vertices of such 
small degree can only increase the average degree in what remains, so this procedure 
must terminate with a non-empty subgraph satisfying the desired global minimum 
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degree requirement. In this example the measure of 'importance' of a vertex is its 
degree. When we apply a similar style of argument, the analogue of degree will be 
"local potency" , suitably defined. Also, our rules for throwing away vertices will be 
more involved, and so it will take some work to verify that in throwing away vertices 
we do not decrease the overall potency by too much. 

We now proceed to the proofs of Propositions 17 and 18. We prove Proposition 17 
first since the reductions required for its proof are simpler. 

5.1. Large deviation patterns: A proof of Proposition 17. Now and for the 

remainder of Section 5.1, we fix L > 20 and a pattern (a, e). Our aim is to find 
S C ^(r) with pld((cI-, e)s) > PLD(a, e) — 30L^/d and such that the inequality 

(l + 2)log(l + e)>-a,.log^— J 

holds for all {i,w) G S. 

As alluded to above, we shall choose 5* by repeatedly removing vertices of V{T) 
that correspond to "inconsequential" parts of the pattern, so that in the resulting 
sub-pattern (a, e) s there is a significant contribution to the overall potency from 
every vertex. We write 



PLD((a,e); (i^w)) 



n 



' ^i,w,i' ,w' 



for the 'local' large deviations potency at {i,w), and write 



(i' ,w')eNr(i,w) 



w' 



w 



for the amount of "L2-squared weight" of the type a that appears on F-neighbours 
of {i,w) weighted by the factor w'/wd}^'^. 

The intuition behind this weighting factor is that in the large deviations regime, 
the most significant contributions to potency should be made by edges (i, w){i', w') G 
-E'(rLD) with w and w' close to a factor of d}^"^ apart. Note that this is exactly what 
happens for the eigenf unctions of the universal cover (the infinite d-aiy tree) with 
near-supremal eigenvalues. 

We say that a set ?7 C V^(r) satisfies condition (LDl) if for each G t/ we 

have 

(LDl) pLD((a,e)c/; {i,w)) > Lai^u,w'^d^/'^ , 

and that U satisfies condition (LD2) if for each (z, w) E U we have 
(LD2) pLD((a,e)5; {i,w)) > LNi^^{s.,e)/d^/\ 

Condition (LDl) asks that pLD((a, g)s] {h w)) is large relative to the size of a^^^; the 
factor ai^ww"^ approximately measures the proportion of L^-squared weight of a used 
by tti^yj. In Condition (LD2) the factor a^^^w^ is replaced by Ni^^/d which (in some 
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sense) measures the average level of opportunity available to a^^^. (In the case of 
large deviations a good opportunity corresponds to {i',w') G Nr^j^{i,w) fl U with 
large ai'^w' large and such that the ratio w'/w is close to d^^"^.) 

We then let S = ^((a, e),L) C V{r) be the maximal subset of V^(r) satisfying 
both (LDl) and (LD2). The monotonicity of pld((cI, e)(/; (i, w)) in U guarantees 
that S is unique and that S can be determined by starting from V^(r) by repeatedly 
throwing away vertices that violate at least one of the two conditions, until no such 
vertices remain. In other words, writing k = |^(r)| — 15"!, we may order the vertices 
of V^(r) \ S* as {ii,Wi), . . . , {ik,Wk) in such a way that for each I < j < k, letting 
Sj = ViT) \ {(ii, wi), . . . , {ij_i,Wj.i)}, we have 

PLD((a, e)5^.; {ij,Wj)) < Lmax (^aj^^w^rf^/^, iVj^^/rf^/^j 

In other words, this ordering "verifies" that [ij, Wj) ^ S, for each 1 < j < k. 
Proving Proposition 17 now reduces to establishing the following two lemmas. 

Lemma 19. Let S = S{{a,e),L) C V(T) be as defined above. Then pi,D{{a, e) s) > 
PLD(a, e) — 30L^/d. 

Lemma 20. Given any subset U ofV{r), if U satisfies (LDl) and (LD2) then for 
all {i, w) G U, 

■ (^1 H ^ — j log(l + ej,^,i/,^/) > — Oj,^ log 

{i' ,w')€Nr{i,w)r\U 

Proof of Lemma 19. With (zi, wi), . . . , (zfc, Wk) and Si, . . . ,Sk as above, write 
Wi = {{ij,Wj) : 1 <j < k,pi,n{{a,e)s/,{ij,Wj)) < Lai^u,w'^d^''^} 

and write 

W2 = {{ij, Wj), 1 < J < A;,|>LD((a, e)^^.; (z^-, Wj)) < LNi,^/d^'^} . 
We note that Wi U W2 = {{ii, Wi), . . . , {ik, Wk)}. Since 

PLD((a,e)5) > pLD((a, e)) - ^ |)LD((a, e)^^; (z^, t^j)), 

i<i<fc 

to prove the proposition it suffices to show that 

Yl PLD((a,e)5,; itj,w,)) < 30Ld'/\ (5) 

First, for any 1 < j < k with {ij,Wj) G Wi, we have pLD((a, e)^^.; (zj, ujj)) < 
Lai^yjw'^d^^'^ , and so 

^ PLD((a,e)s^.;(^j,M;j)) < ^ La^^^w^d^'^ <lQLd^/\ 

l<j<k l<j<k 
{ij,Wj)&Wi {ij,Wj)&Wi 

the last inequality holding since a is a Z-type so w')ev{r) '^''^(^i',w' < 10. 
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w' 



w 



Now, for {ij,Wj) G W2, we have that pi^Y){{aL,e)s/i{ij,Wj)) < LNi^y^/ d}/"^ , where 
(we recall) 

(i' ,w')eNrii,w) 

Note that for any (i, w) E V{T) and any neighbour {i', w') of w) we have w' /wd}^"^ < 
1. Since all weights w are in D>°, it follows that 

for each {i',w') G V"(r). Thus 

(i,«))eVK2 (i,«))eVK2 {i',w')&Nr{i,w) ^ ^ 



^1/2 y,^l/2 
(i',-i«')eV(r) (j,-!«)6Arr(j',i«') 

where the final inequality follows from (6) and the fact that a is a Z-type. 

Since Wi U W2 = {(«i, Wi), . . . , {ik, Wk)}, the above bounds establish (5), complet- 
ing the proof of the proposition. □ 

In preparation for the proof of Lemma 20, we first record the following fact. Given 
U C V(T) and {i,w) G V(T), define 

The set Ui.Bih'w) contains the vertices we shall view as making an important con- 
tribution in our forthcoming use of Lemma 15. 

Lemma 21. If U C V(T) satisfies (LD2) then for all {i,w) G U, 

n 2 



UM^, nj) = \ (z', w') G iVr,, (z, w)nU: > 



EWW' aiyjail yjl 1 \ /■ \\ 



Proof Fix (z, u;) G U, let = iVr^Dl^, w) n f/, and let S = 2^. Then for (z', w;') G fi, 
set 

/ 12 I 

t ,w ) = ai^y^ai'^y,', h[t ,w ) = , g[t ,w ) = . . . 

n nd^i^ wd^i'^ 

Using (LD2), we then have 



and 



/ hd^i = pLD((a, e)u] (z, w)) > 



{i' ,w')^Nr^^{i,w)r\U 
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It follows that 



U^Uz,w) D i {^y) e Nr,,{z,w)nU: > ^^^^^ 



g{i',w') 2 J gd^x 
and so by Lemma 15 applied with c = 1/2, 

> > / ^hdfi>- hdfi= -pLD{{a,e)u;{t,w)). 

□ 

We now prove Lemma 20, completing the proof of Proposition 17. 

Proof of Lemma 20. We are given U C V^(r) satisfying (LDl) and (LD2), and aim 
to show that 

■ (^1 H — J log(l + ei,w,i',w') > -^ai,w lo" 

{i' ,w')&Nr{i,w)r\U 



Note that clogx > log(c^x) for x > and c > 1. Since 1 + ej^^^j/^^/ > for all 
{i',w') G A^rLDl^'""^) have that 

^^log(l + e,,^,,^0 > log (^(1 + e,„^,,^0 J J > log j . 

It follows that 



Oji^w^i' ,w' ( -. , ^i,wA'w' 



(l + ) log(l + ei,^y,„/) 



w'^d^/'^ n \ 

,2 



1 H ^ — log 1 + ei.w.i'.w'] 



- 2M^^ n log ^ (^,)2 ) ■ (7) 

Note that the expression inside the preceding logarithm is precisely the expression 
included in the definition of Ui^nih w). Applying first (7), then Lemma 21 and finally 
(LDl), we obtain that 

(^i,w(^i' ,U}' j I ^i,w,i',w' 

n 



[i' ,w')eNr^j^ {i,w)r\U 



> 



I / 2 J 

^i,wA'w' iOg 

n 



2^2^1/2 ^ 



L ^ f en 
> — a,- ,„ lo( 



4 Va,; 



completing the proof of the lemma. □ 
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5.2. Small deviation patterns: A proof of Proposition 18. Our proof of 
Proposition 18 is similar to our proof of Proposition 17 given above. In that spirit, 
throughout Section 5.2 we fix L > 20 and a pattern (a, e). We will find the required 
set S by repeatedly removing vertices of V{r) that make a small contribution to 
potency. 

Given a pattern (a, e) and {i,w) G V^(r), we write 



PsD((a, e); {i,w)) 



n 



WW ai,wai',w' 



for the 'local' small deviations potency at {i,w), and write 

Ni = Ni{a,e)= ^ w''^ai>^y,>. 

(i',w')£V(r):i'r^„i 

for the proportion of the L2-squared weight of a that appears on fibres Vi' that 
neighbour Vi. Our next aim is to state small deviations analogues of Lemmas 19 
and 20. However, our reduction rules are slightly more involved in this case, and 
require one additional definition. Let 

A'',- en 



Mi^yj = Mi J a, e) = max 



a,- nnw'^d ' a 



and let mj ,^ = (log Mj ,„)/M(^^. Note that we always have Mj^^ > en/ai^^ > e. Since 
the function xlogx"^ is increasing on (0,e~^] it follows that we may equivalently 
write 

Ni \ o.i,wW^d J Ni — en 



mi 



^ log ( if J_ < h!^ 

en \Cii,w I — Ni 



In what follows we will use that nii^^ < LISM^^''^ always holds. This follows from 
the fact that logx < l.lSx^^'^ on (0, oo). 

We say that a set f/ C V{T) satisfies (SDl), (SD2), and (SD3), respectively, if 

(SDl) psD((a,e)c/; > Lai^^w'^d^/'^ 

(SD2) psD((a,e)c;; {i,w)) > LNi{a,e)ai,^/{nd'/^), or 

(SD3) psD((a,e)t/; iz,w)) > LNi{a,e)mi^^{a,e)/d^/^ 

for all {i,w) G U. 

We remark that (SDl) is identical to (LDl), and asks that psd((cI, e){/; {i,w)) is 
large relative to ai^y^w"^. In (SD2), ai^^/n is the proportion of the fibre Vi consumed by 
a set of size aj,^, and A^i/d represents the 'average opportunity' in the neighbourhood 
of {i,w). Notice that unlike in the large deviations case, no factor of the form 
{w' /wd^/"^) appears. This corresponds to the intuition that in the small deviations 
case, large potency is most likely to come from large sets of roughly equal weight. 
Finally, (SD3), in which mi^u, appears, is a slight strengthening of either (SDl) or 
(SD2) - by a logarithmic factor - depending on which value mj^^ takes. 

Let S = S{{a,e),L) be the maximal subset of V(T) such that every {i,w) G S 
satisfies (SDl), (SD2), and (SD3). As in Section 5.1, writing k = |V^(r)| — 15*1, we 
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may order the vertices of V{r) \ 5 as (zi, Wi), . . . , {ik, Wk) in such a way that for each 
1 < j < ^5 letting Sj = V{T) \ {(ii, wi), . . . , {ij-i, Wj^i)}, we have 

In other words, this ordering verifies that {ij,Wj) ^ S, for each 1 < j < k. Proposi- 
tion 18 is an immediate consequence of the following two lemmas. 

Lemma 22. Let S = S{{a,e),L) be as defined above. Then we have psb{{^,^)s) > 
PsD((a, e)) - 55LVd. 

Lemma 23. Given any subset U ofViT), if U satisfies (SDl), (SD2), and (SD3) 
then for all (i, w) G U , 

n 15 10 \aiu] 

{i',w')eNrg^{i,w)nU ^ ' 

Proof of Lemma 22. Write 

Wi = {{ij,Wj) : 1 < j < k,psDiia,e)s/, (ij^Wj)) < Lai^^w'^d^^'^} 

W3 = {{ij.Wj), 1 < j < /c,psD((a,e)5^,; {ij.Wj)) < LNinii^^/ d^'"^} . 

Also, let = WsXWi. We note that for {i,w) G we have mj_^ > ai^ujw'^d/Ni. 
Since Mj^^ > e in all cases, we have that mj „, < and so 

a bound we will use later. 

Since W1UW2U = {{ii,Wi), . . . , {ik, Wk)} and 

PsD((a, e)^) > psD((a, e)) - ^ PsD((a, e)^^; {ij, wj)) , 

i<i<fc 

it suffices to prove that 

^ PsD((a,e)5^.; {ij,Wj)) < ddld^^'^ . 

(ij ,Wj ) 614^1 U14^2 Ul^g" 

The proof that 

J2 PsD((a,e)5,; {ij,w,)) < lOLd^s (9) 

is identical to that given in the proof of Lemma 19. 
For 14^2 we use the fact that for all i' G V{H), 

ai,w < nd , 

(i,w):i&Njj{i') 
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and proceed as follows. 

= lOLd'/^ (10) 

the last inequality holding since a is a Z-type so w')ev{r) '^'^(^i',w' < 10. 

The argument for VFg" is slightly more involved. We shall deduce our bound on 
X]fe,t«,)6M/3-PsD((a,e)5^-; {ij,Wj)) from the following claim. 

Claim: For each i G V{H) and each j > 1 there are at most (j — 1) vertices 
{i,w) e W7 for which M^,^ < 2^'. 

Proof. Fix j >1. First, by (8) we have that < Ni/{eai^^d). If Mi,„ < 2^ then 
by the definition of Mj^^, we also have Oj,^ > en/2-' , and so 

~ e^nc? nd 

Using the other term in the maximum which defines Mj^^, we see that if Mj^^ < 2-' 
then we also have 

W > —■ ; > 



2^ai^^d 2^nd 

Since if is a dyadic multiple of {nd)~^^'^ and our upper and lower bounds for w"^ are 
a factor of less than 2^-'"^ apart, the claim follows. □ 

2 /3 

By the above claim and the fact, noted earlier, that mj^^ < 1.18M^^ always 
holds, we have 

J2 rn,^< E l-18^C<E%77r <3-5- (H) 

w:{i,w)£W^ w:{i,w)eW^ i^l 

We are now ready to complete the proof of the lemma. We have 

(i',u>')ev(r) {i,w)ew.;[:i&NHii') 

(i',u.')eV(r) {i,w)&W^:i(iNH{i') 

< 35Ld^/^ , 

the first inequality following from the definition of W3, and the final inequality 
holding from (11) and since a is a Z-type. Combining this bound with (9) and (10) 
completes the proof. □ 
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The last step before proving Lemma 23 and thereby completing the proof of 
Proposition 16, is to establish an analogue of Lemma 21 for the small deviations 
case. In this case it turns out to be more straightforward to argue directly rather 
than to use Lemma 15. 

Given U C V{T) and {i,w) G V{T), let 
Lemma 24. For all {i, w) G U, we have 

> ^PsD((a,e)c/; {i,w)) . 



n 



Proof. Fix G U. Since 



by the triangle inequality it suffices to show that 



WW a.yjai'y,! 



n 



ww' aj^^ai' ^yj' PsD((a, e)u\ {i,w)) 

\(^i,w,i'w'\ _ „ 

n 2 



{i',w')&Us-D{i,'w) 

where UsDihw) denotes the set {Nrgj^{i,w) H U) \ UsDihw)- We observe that the 
above sum may be re-expressed as 



(i',«)')e6''sD(*,w) 



/2 

w aiiy,, 



\^i,w,i' ,w' I 



ww'n 



and that the bound 



\ei,w,i',w'\ ^ PsD((a, e)u] {i, w)) 



ww'n 



2w'^ai^Ni 



holds for all {i',w') G UsBihw). The required bound now follows immediately by 
noting that 



E 



w''^aiiyji 



< ^ < Ni 



the last inequality holding by the definition of A^j. 



□ 



Proof of Lemma 23. Fix U C V{r) satisfying (SDl), (SD2), and (SD3), and fix 
{i,w) G U. To prove the lemma, we must show that 

E(^i,w(^i' ,w' ^i,w,i',w' ^ L / 677. 

n 15 10 \a,iw 

{i',w')eNr^j^{i,w)r\U ^ ' 
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The proof divides into two cases depending on which value mj takes. Before 
considering these cases separately, we note the following inequality: for all {i,w) G S, 

,2 

^ n 15 

{i' ,w')&NTgj^{i,w)r\U 

n sr-^ 
~ 15 ^ 

ii' ,w')^Nt^j^ {i,w)r\U 



ww' tti^^ai' 


^i,w,i' ,w' 






^i,w,i' ,w' 






\ ww'n J 



> 



> 



> 



E 



15 n 

{i' ,w')eUsD{i,w) 



15 2w'^aiwNi ^ n 



the second inequality holding by the definition of UsD{i, w) and the third holding by 
Lemma 24. It therefore suffices to prove that for all {i,w) G S, 



npsD((a,e)[/; {i,w)y L 



QOw^a^.nNj 10 ' \a. 



en 



> — a,,^log . (12) 



First, if w^d/Ni < 1/en, then m^,^ = log (^-^2^). In this case, by (SD3), 

we have that 

PsD((a, e)^; (i, w)) > Lvo^ai^^S'"^ log ( ) > Lw;^a^,^„ci^/^ log ( — ) , 

where the final inequality follows since ai^^w'^d/Ni < ai^uijen. Combining this bound 
with (SD2) yields that 

(( \ (■ ,,2 ^ L^w^<^lwN, fen 
PsD((a,e)(7; > log 



n 

which verifies (12) in this case, since L > 20 > 6. 

In the remaining case we have 1/en < w'^d/Ni, from which it follows that mj^^ = 
ai^w^og{en/ai^w)/en. So, by (SD3), we have that 

PsD((a,e)[/; {i,w)) > ■ 

end^/"^ ■^^^(^''^/'^ijUi) • 
Combining this bound with (SDl), we obtain that 

PsDi{a.,e)u;{i,w)) > ■ log(en/ai,^), 

which verifies (12) in this case since L > 20 > 6e.This completes the proof. □ 
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6. Proof of Theorem 2 



Our proof of Theorem 2 is similar in nature to our proof of Theorem 1. As in 
that proof we shall use Proposition 10, which tells us that if X*{G) is large then G 
contains a pattern (a, e) of large potency. We require two other results. The first, 
which is easily proved, is an approximate converse to Proposition 10. The second, 
whose proof is more involved, is a variant of Theorem 11. 

Given a pattern (a, e) and a witness {Ai^w}{i,w)ev{r) for the pattern, write a = 
a(a, e) = Y.{i,w)ev{r) ^"^6 G[{^i,«;}(i,t„)ev(r)] for the subgraph of G induced 

U(i,to)ey(r) ^i,w- 

Proposition 25. If G contains a pattern (a, e) with j9(a, e) = p\/d then X*{G) > 
{2p-40)Vd. Furthermore, A(G[{A,,^}(,,^)ev(r)]) > (2p - 40)v^ - 



Proof. Let {Ai^^ : (z, w) G V^(r)} be a witness for (a, e), and let y G M"^ be defined 
by, for each {i,w) G V{T), setting y{v) = w for all v G Ai^y^ (and setting y{v) = 
for all remaining v). Since 

Wvlll = < 10, 

(i,«>)ey(r) 



by Lemma 9 we have 

\{y^y)N\ = \ (y^ y)N,E* + y)N.M?\E* \ >\(y^ y)N,u^\E' I - ^oVrf. 

But 2p(a, e) = | (y, y)i^^2\E* I' I y)N I — (2p— 40)\/rf, and the first result follows 
from Fact 7. 

For the second bound, write G' = G[{Ai^uj}(i,w)£V{r)], and write M' for the adja- 
cency matrix of G'. Also, let y' G be the vector obtained from y by retaining 
only coordinates corresponding to vertices of G' (recall that all other coordinates of 
y are equal to zero). Then 

I {y^y)N I = I {y^y)M - {y^y)M \ = I {y'^y')M' - (y^y)Ml 

and since all entries of M are either zero or l/ra. 



{y,y)M< -( Yl - 



n — ' n 



where in the last inequality we used that, given the constraints that y has a non- 
zero entries and \\y^ < 10, the sum ^CZliv<£\nh\ ?/«)^ maximized by taking all nonzero 
entries equal to lO^^^a"^/^. Thus 



I ,||2 ^ \y , y /A// - I \y, y/ATi i \y , y /m . ^ 
li/ II2 
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Theorem 26. Fix L > 20. For any pattern (a, e), there exists S = S{{sL,e),L) C 
V{T) such that the following properties hold. 

pi{a,e)s) >p(a,e) - IbOlVd 



P {(a, e)^ can be found in G} ^ Y\ 



ii,w)es 




I-L/IO 



n 



In the same way as Theorem 11 was deduced from Proposition 16 (together with 
Corollary 14) Theorem 26 may be deduced from the following proposition (Propo- 
sition 27). We recall from Section 5 the definition of b : (—1, oo) — j- [0, oo), 

6(e) = (l + e)log(l + e) -e ee(-l,oo). 

Proposition 27. Fix L > 20. For any pattern (a, e), there exists S = S{{a., e), L) C 
V{r) such that the following properties hold. 

p{{sL,e)s) > p{sL,e) — 150LVd and 



E 

(i' ,w')€NTii,w)nS 



n 



b{ei 



en 



for all{i, w) E S . 



Theorem 26 follows from Proposition 27 in exactly the same way that Theorem 11 
follows from Proposition 16. Rather than repeat this proof we refer the reader to 
the proof of Theorem 11 given in Section 5. 

We recall that our proof of Proposition 16 divided into two cases depending on 
whether pLD(a, e) > p(a, e)/2 or psD(a, e) > p(a, e)/2. These two cases were dealt 
with separately by Proposition 17 and Proposition 18 respectively. However, in 
Proposition 27 we seek a lower bound on p{{a,e)s) rather than on p{{a,e)s), and 
for this reason we can not treat the large and small deviations cases separately. In 
other words, we must work with the graph P rather than exclusively focussing on 
one of the subgraphs Pld, Tsd- That having been said, our proof of Proposition 27 
resembles the proofs of Propositions 17 and 18 in almost all other respects. 

We begin by establishing the following sufficient condition for the second bound 
of Proposition 27 to hold. For the remainder of the section fix a pattern (a, e) and 
a constant L > 20. In what follows we denote by 



p((a, e); {i,w)) 



{i' ,w')eNr{i,w) 



WW ai^^ai'^u,' 



n 



the 'local' potency at (z, w) 
defined in Section 5: 



and we recall the large and small deviations variants 



and 



PLD((a,e); {i,w)) 



PsD((a, e); {i,w)) 



E 

E 



WW a.^ai'y,' 



n 



WW ai^u,0'i',w' 
n 



-l,W,t' ,w' 
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We say that a set t/ C V{r) satisfies (Gl), (G2), (G3), and (G4), respectively, if 

(Gl) p((a,e)c/; {i,w)) > 2La,^^w^d^l\ 
(G2) p((a,e)c/;(^,^/;))>2LiV,,^/rfV2, 

(G3) p((a,e)c/; {i,w)) > 2LNiai,^/ {nd}/^), or 
(G4) p((a,e)c7; {i,w)) > 2LNimi,^/d'/\ 

for all € U, where Ni = A'j(a, e), A'j^^ = A^j „,(a, e) and mj „, = mj „,(a, e) are 

as defined in Section 5. 

Lemma 28. If U satisfies (G1),(G2),(G3), and (G4) then 

EO'i,wO'i'w' ,/ \ ^ ^ 1 I en \ 
o[ei^w,i',w') > log I 1 . 
Th XU \ Qjj 7/! / 

(i',to')6Afr(j,w)nC/ ^ 

Proof. Since E{V) = E(Ti^d) U i?(rsD) it follows from the triangle inequality that 
P((a,e)5; {i,w)) < pLD((a,e)c/; (z, u;)) + psD((a, e)^/; and so 

max{pLD((a,e)c/; (z, u;)),psD((a, ejf/; > . 

First suppose that pLD((a, e)(7; {i,w)) > p{{a.,e)u] {i,w))/2. In this case, (Gl) and 
(G2) imply that U satisfies conditions (LDl) and (LD2), and so, by an application 
of Lemma 20 we then have 

■ + — ^ — j log(l + ei,^,i/,^0 > -Oi,^ log I — 1 . 

The lemma now follows since (by (4)) h{ti^u,,i',w') > (1 + (-i^w,i',w' /2) log(l + ei^u,,i',w')- 
By Lemma 20 we then have 

■ ( IH ^ — ) log(l + ei^w,i',w') > —aLu, log I 1 , 

(note that the L of conditions (LDl) and (LD2) became 2L in conditions (Gl) and 
(G2)), which proves the result in this case. 

Otherwise, we must have that psD((a, (i, to)) > p((a, e)^/; (i, w))/2 and in this 
case the result follows similarly from (4) and Lemma 23. □ 

Now let S C V{T) be the maximal subset of V^(r) satisfying all of (G1),(G2),(G3), 
and (G4). The proof of Proposition 27 is then completed by the following lemma. 

Lemma 29. Let S he the subset defined above. Then 

P((a, e)s) > p(a, e) - 150Lv^ . 

Proof. This is proved exactly as Lemmas 19 and 22 were proved; we skip the details. 

□ 



We now have all the necessary preliminaries in place and we may turn to our proof 
of Theorem 2. 
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Proof of Theorem 2. We begin by noting that the star graph F consisting of a single 
vertex of degree d attached to d vertices of degree one is always a subgraph of G 
and has X{F) = \/d, which proves the result when X*{G) < 1 189248 

We may now suppose that X*{G) = My/d for some M > 1189248. It follows 
by Proposition 10 that G contains a pattern (a, e) of potency at least K\fd, where 
K = K{M) = (M/192) - 3 > 6191. By Theorem 26, applied with L = L{M) = 
K/151 > 41, we have that G contains a reduced pattern (a, e) with p(a, e) > Ly/d 
for which 

P{(a,e)canbefoundinG}« J] C( H Ca|1/2|)| ' ^^^^ 

where a = E(i,^)ey{r) 

Let {yli_^}(j,^„)gy(r) be any witness for (a, e)^, and write G' = G[{Ai_^}(i,^)6y(r)]. 
By Proposition 25 we have A(G") > (2L - 40)Vd - so for n > y/TOhd, if it 

happens that a < hd then A(G") > Ly/d > A*(G)/1189248, as required. 

To complete the proof we must bound by n~^'^ the probability that any reduced 
pattern (a, e) of potency at least L\/d with a > hd may be found in G. We remind 
the reader that for reduced patterns (a, e) we have the inequality (13). We split 
the proof of the required bound into two steps. First, recall the event Ei from 
the proof of Theorem 1, which was the event that any reduced pattern (a, e) with 
e) > Ly/d and for which ai^^ > Ahd\og2 d for some (i, w) G V{T), can be found 
in G. In proving Theorem 1 we showed that P {Ei} < n~^'^/2. 

Second, let E^ = E^{M) be the event that G contains a reduced pattern (a, e) 
with p(a, e) > Lyfd, with a = io)6S ^ ^"^^ with ai^^ < 4:hd\og2d for all 
{i,w) G V^(r). We complete the proof by proving that P {E^} < n~^'^/2 for all n 
sufficiently large. As in the proof of Theorem 1, by a union bound we obtain that 

P{E,{M)}<\og2{nh){4hdlog2dY^''°>^^^ 

Since hd + 1 < a = «;)e5 ^ (4^c^log2 (i)(/ilog2 (i), the following bound holds 
for all n > 2a: 

P{E,{M)} < log2(n/^)(4^dlog2C^)^^-^^°^^'^ ^^^^'|^+^P''^\ 
which is less than n~^'^/2 for n large enough. □ 



7. The remaining proofs 



Before proving Proposition 8 we establish the following, intermediate result, which 
contains our key convexity argument. For this step, it is notationally convenient to 
consider vectors x E M^^*^^ with ||x||2 = 0{nh) rather than 0(1). To this end, define 

D = {0} U {2V z G N} U {-2' : z G N} 

Z) = {0} U {2^ z G N}, 
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and let 



XII2 < nh] 



y\\l<hnh,^ V eV{G),y, e D} , 
y\\l < 10nh,\/ V e V{G),y^ E D} . 



Y = {ye M^(^) 
Y+ = {ye M^(^) 
Note that by Fact 7, nh\*{G) = sup^^x* I (^5^) at I- 
Proposition 30. nhX*{G) < 12sup^gy+ | {y,y),^ \ . 

Proof of Proposition 30. The proof is rather straightforward; it consists of first an 
averaging argument, and second a polarization argument. The polarization argu- 
ment becomes a little delicate only because we are trying to maintain the property 
that the entries of all vectors remain in D. 

For r G M let (j(r) = r/\r\ if r 7^ 0, and cr(r) = if r = 0. We call cr(r) the sign of 
r. Also, let £(r) be the greatest element of D which is less than or equal to r, and 
let u{r) be the least element of D which is greater than or equal to r. 

For X e My^'^\ write 

= {y e M^(^) : G ViG),a{y,) = a{x),y, G {i{x,),u{x,)}}. 

Now fix any x G X* and let y G F be randomly chosen as follows. Independently 
for each v G V{G): 

• if < < 1 let yi = cr(xi) with probability \xi\ and = otherwise. 

• if 2^ < \xy\ < 2^^^ for some j > 1, then let y^, = ay2^~^^ with probability 

{\xv\ — 2^)/2^'^^ and let y^, = av2^ , otherwise. 

By definition, all entries of y have values in {0, ±1, ±2, ±4, . . .}. Furthermore, for 
each V with \xv\ > 1 we have y^ < (2x„)^, and for i with \xy\ < 1 we have yl < 1. 
It follows that 

v£V{G) v£V(G) 

SO y is a random vector in y fl S^. Since, for each v G V{G), E [y^,] = Xy and the 
coordinates of y are chosen independently, we then have 



yP {y = y} 



X = 

so 

{x,x)^= {y^z)^P{y = y}P{y = z} . 

y,z&r\Sx 

In other words, is a convex combination of elements of {(y, z)^ : y,z E Y}. 

Choosing x G X such that | (x,x)jy | = nhX*{G), we then have 

nhX*{G) = \ {x,x) x^l < sup \{y,z)j^\. (14) 

y,zeYnSoo 

Note that for all y,z E Sx and all v G V{G), either y^ = or else {y^.z^} = 
{i{xv),u{xv)}. In particular, y^ and are either both non-negative or both non- 
positive. 
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Choose y,z G Y r\ for which the supremum in (14) is achieved, and write 
y = y+ — z = z+ — z^, where e.g. ?/+ is the vector obtained from y by replacing 
all negative entries of y by zeros. Then for all v G V{G), (j{y^) = and 
cr{y~) = (j{z~) We then have 

nhX*{G) < I {y,z)^ \ = \ {y+,z+)^ - {y+,z^)^- {y_,z+)^ + {y^,z^)^ |, 

so either one of | (?/+, z^)j^ \,\ (?/„, z+)j^ \ is at least nh\*{G)/6 or else one of | (?/+, -2+)^ 
I {y_,z_)j^ I is at least nh\*{G)/3. 

First suppose | {y^,z^)j^ \ > nh\*{G)/6. Since 

{y+ + z^,y+ + z^)^ = 2 {y+,z_)^ + {y+,y+)^ + {z_,z_)^, (15) 

it follows that either (?/+ + ?/+ + z^)j^ \ > nhX*{G)/9 or else one of | y+)j^ \ 
or I z^)j^ I is at least nhX*{G)/9. Also, since y,z E Sx, the non-zero coordinates 
of |/_|_ correspond to zeros in z_, we have y^ + z_ G Y~^. Since Hy+Hl, Ik-Hi; ^'^^ 
\\y^ + 2;-||2 are all at most ||y||2 + \\z\\2 < lOnh, in this case we have proved the 
proposition. This argument also handles the case \{y'_z^)j^\ > nhX*{G)/6, by 
symmetry. 

The other case is that | > nh\*{G)/3 (the proof in the case that 

I {y-,z_)j^ I > nh\*{G)/3 is symmetric). Since 

{y+ - z+,y+ - z+)^ = {y+,y+)^ + {z+,z+)^ - 2 {y+,z+)^, 

it follows that either one of {y+,y^)j^ , (2+, 2+)^ is at least nh\*{G)/12, or else 

{y+ - z+,y+ - z+)^ > nhX*{G)/2. 

In the former case we have proved the proposition. In the latter case, note that since 
for all V G V{G), either yy = Zy or else {yy, Zy} = {i{xy),u{xy)}, we have that all all 
entries of y+ — z+ are in D, and furthermore, for each v, either \yy,+ — Zy^^\ = or 
\yv,+ — | = 1 or \yy^+ — Zy^+l = \yy^+\ A \zy^+\. It follows that \\y^ — < 6nh. 
Now write w = y^ — z_^., and then separate w into its positive and negative parts: 
w = — . We then have 

nh\*{G)/2 < I {w,w)j^ I = I {w+,w+)j^- 2 + |. 

But we also have 

(w+ + + = 2 (w+,w7„)jv + + , 

and it follows that either one of | (^4., w+)jv I or \ w„)jy | is at least nhX*(G) /lO, 
or else | {w+ + W-,w+ + W-)j^ \ > nh\*{G)/10. Since all of ||ty+||2, ||w^_||2 and + 
tf„||2 are at most \\w\\2 < Qnh, they are all in and so the proof is complete. □ 

Proof of Lemma 9. For this proof write = {nyu])v,w&v{G)- For a given vertex v G 
V{G), there are precisely d vertices w G V{G) with Uyy, = 1 — 1/n, and [n — l)d 
vertices w G V{G) with Uy^ = —1/n (the remaining entries in row v oi N are all 
zero). Thus, for any v G V{G), 

1 1 

\yv nyy,yy,\^^^^yJ<d{l-l/n)-^ + {n-l)d{l/n)^ 

< '^.yl^Td. 
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Writing A = {{x,y) e : x > yVd}, it follows that 

\{y,y)N,m^\E'\ = '^\{y^y)N,A\^'^ Yl '^yl'/d = AVd\\y\\l. 

veV[G) 



□ 



Proof of Proposition 8. Choose z G for which | {z, z)j^ \ = \ supj^gy+ {y,y)j^ \ > 
nhX*{G) /12, possible by Proposition 30. Since ||2;||2 < lOnh, by Lemma 9 we have 

I {z, z)^^^^ I > {z, z)^ - AQnh\fd. 

Next, for each m e Nq let 1^ = E* n [rf'^/^, d(™+2)/2) x ci('"+2)/2) c M^^ and 

note that E* = IJmeN ^'^ triangle inequality 

oo 



Set 



m=0 



II .,112 ' ^ 



{.6y(G):.„e[d'»/2,d(™+2)/2)| ll^lli ' (I (^'^)7V,i?. \Pm) 



SO that 



Since no point in lies in more than two of the Im, we have YlmeNPm — 2, and so 
there must be m* G N for which > 1/2. Letting the the vector whose entry 
in position v is z^l^zyei^*], we then have that 

I 5 ^ ) N,E I ~ I ^) N,I^t I — Pm* I ^) N,E* 1/2; 

and furthermore, ||2;*||2 = PmlNlli- Let y be the vector obtained from z* by multi- 
plying all entries of z* . By 2l-~('°S2Pm)/2j ^ j^^^^g 

I I > ^2ik£^ > _ 5„,^ > ^ _ 5„,^_ 

and II1/II2 < \\z\\\ < lOn/i. Finally, recall the definition of Z from (1). Letting x 
be the vector obtained from y by dividing all entries by {nhY^"^, we obtain a vector 
a: G Z with | {x,x)j^^\ > A*(G)/96 — 5\/d, which completes the proof. □ 

We now give the promised proofs of Proposition 3 and Lemma 12. 

Proof of Proposition 3. Let a < n — hn^^"^ be the number of vertices in G', and let 
x' G M^^"^ ^ be an eigenvector of G' with eigenvalue A, chosen so that ||a;'||2 = 1. For 
each z G [h], let ti = J2veVinv(G') define a vector y G M^'^'^\ for all i G [h] and 

each V G Vi\V{G'), setting y^ = —ti/\Vi\V{G')\, and taking all other y^ equal to zero. 
Taking x = x' + y, it is immediate that x is balanced. Also, for v G V{G) \ V{G'), we 
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have \xv\ = < l/{n — a), and it follows that ||x||2 < 1 + nh/(n ~ a) < 1 + 1/A, 
since {n - a) > hn^^^ > {n(Pf/'^ > {ndXy^^. Now, 

{x, x) = {x, x) 

= + {y,y)^j + 2{x',y)^j 

= A + (y,y)jv^ + 2(x',?/)j^^. 

Since all entries of y have modulus at most {n — a)^^, it follows that | {y,y)^j \ < 
\E{G)\/{n—a)'^ = nhd/{2{n—ay) < ri^/'^d/ {2{n—a)). Similarly, since Y1,v<^v{G') l^f I — 
a^/^, we have 2| {x' ,y)^,j- \ < 2a}/'^d/{n - a). It follows that 

, , , , , d(n'/^ + 4ai/2) 5dr2i/2 ^ 5 

\(x,x),^\ > A ^ — z ^ — - > X z > A , 

' ^ ' ' - 2{n-a) - 2{n- a) 2' 

and recalling that ||x||2 < 1 + 1/A < A/(A — 1) completes the proof. □ 

Proof of Lemma 12. We may specify a pattern by first choosing wq, then choosing 
Cj^^ for each (i, w) with i G [h] and w G -D^° fl [wq, Wod], and finally choosing fi^w,i',w' 
for each pair {i,w), {i',w') G V^(r) with i i' ■ 

If the pattern is to be non-empty then there is some (i, w) G V{T) for which 
ai^wuh/w'^ is a positive integer. Since ai^y, < 1 it follows that <w'^ < nh and so 
there are at most \D^^ fl [0, v^r;7i]| < \og2{nh) choices for Wq. 

Having chosen wq^ since \D^^ fl [w;o; w7orf]| < log2(2(i), there are at most ^'^i°g(2'i) 
choices for the ai^w Finally, for each pair {i',w') with i i' ■, there are at 

most 1 + min(aj^^, Cj/^^/) < A choices for fi^w,i',w'- There are at most hd\og2{2d) /2 
such pairs, so the total number of choices for the fi^w,i',w' is at most y4'*'^ios2(2rf)/2_ 

Combining the bounds of the two preceding paragraphs, we obtain that the total 
number of patterns with ai^^ < A for all {i,w) G V^(r) is at most 

log2(n/l) ■ ^'^l°g2{2rf)^Mlog2(2d)/2 ^ iog2(n/i) ■ A^'"^^''^^'^. 

□ 
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